Workshop Welcome and Introduction
8:30-8:45
Matteo Golfarelli
The ACM Fifteenth International Workshop on Data Warehousing and OLAP (15 mins)
Invited Talk
Chair: Il-Yeol Song
8:45-9:30
- Kostamaa, Pekka - Teradata
Efficient Big Data Analytics using SQL and Map-Reduce (45 mins)
(hide abstract)
Big Data is the current hot topic in data analytics. In addition to the traditional, relational data, new semi-structured and unstructured data volumes are growing at a rapid pace.
Big Data is not just about Volume, but also about Velocity and Variety of the data. This means that the amount of data to be stored for analysis is growing, s the speed at which it
generated is increasing, and the variety of the data is changing, sometimes dynamically. Traditional data processing methods using relational database technologies are not optimal
to perform efficient analytics against the Big Data types. This data needs to be analyzed together with the data warehouse data, which is typically residing in a relational data base.
Two popular means for doing these analytics is using the declarative language SQL with the relational data, and the procedural Map-Reduce programming style against the semi-structured Big Data.
The Teradata Aster platform allows a user to seamlessly combine both types of analysis using the SQL/MR technology. This talk presents real customer use cases that have provided high
value returns to the customers. We also show how the technology is used to accomplish this analytics by allowing the use the best processing method depending on the type of processing and data.
Session 1: OLAP Query processing and Trends
Chair: Alkis Simitsis
9:30-10:45
- Bernd Neumayr, Stefan Anderlik and Michael Schrefl
Towards Ontology-based OLAP: Datalog-based Reasoning over Multidimensional Ontologies (25 mins)
- Patrick Marcel, Rokia Missaoui and Stefano Rizzi
Towards Intensional Answers to OLAP Queries for Analytical Sessions (25 mins)
- Michel De Rougemont and Phuong Thao Cao
Approximate Answers to OLAP Queries on Streaming Data Warehouses (25 mins)
Session 2: Data Warehouse Design and Maintainability
Chair: Carlos Ordonez
11:05-13:00
- Petar Jovanovic, Oscar Romero, Alkis Simitsis and Alberto Abello
ORE: An Iterative Approach to the Design and Evolution of Multi-Dimensional Schemas (25 mins)
- Svetlana Mansmann, Nafees Ur Rehman, Andreas Weiler and Marc H Scholl
Discovering OLAP dimensions in semi-structured data (25 mins)
- Nicolas Prat, Imen Megdiche and Jacky Akoka
Multidimensional Models Meet the Semantic Web: Defining and Reasoning on OWL-DL Ontologies for OLAP (25 mins)
- Alejandro Maté, Juan Trujillo, Elisa De Gregorio and Il-Yeol Song
Improving the Maintainability of Data Warehouse Designs: Modeling Relationships between Sources and Requirements (25 mins)
- Stefan Berger and Michael Schrefl
FedDW Global Schema Architect - UML-based Design Tool for the Integration of Logical Data Mart Schemas (16 mins)
Lunch break
13:00-14:00
Session 3: Performance and Benchmarking
Chair: Wu Bin
14:00-15:45
- Stephan Müller
An In-Depth Analysis of Data Aggregation Cost Factors in a Columnar In-Memory Database (25 mins)
- Chantola Kit, Marouane Hachicha and Jérôme Darmont
Benchmarking Summarizability Processing in XML Warehouses with Complex Hierarchies (16 mins)
- Craig Stanfill
Type 2 Slowly Changing Dimensions: A Case Study Using the Co>Operating System (16 mins)
- Jianting Zhang, Simin You and Le Gruenwald
High-Performance Online Spatial and Temporal Aggregations on Multi-core CPUs and Many-Core GPUs (16 mins)
- Doulkifli Boukraa, Omar Boussaid, Fadila Bentayeb and Djamel Eddine Zegour
Managing a Fragmented XML Data Cube with Oracle and Timesten (16 mins)
- Arian Baer and Lukasz Golab
Towards Benchmarking Stream Data Warehouses (16 mins)
Session 4: Warehousing complex data
Chair: Patrick Marcel
16:05-17:30
- Elio Masciari
Warehousing and Querying Trajectory Data Streams With Error Estimation (25 mins)
- Carlos Garcia-Alvarado and Carlos Ordonez
Query Processing on Cubes Mapped from Ontologies to Dimension Hierarchies (25 mins)
- Alfredo Cuzzocrea and Paolo Serafino
Enhanced Clustering of Complex Database Objects in the ClustCube Framework (16 mins)
- Mu Yin, Bin Wu and Zengfeng Zeng
HMGraph OLAP: A Novel Framework for Multi-dimensional Heterogeneous Network Analysis (16 mins)